v2.3.1 / Open Source / AGPL-3.0

Your AI. Your machine.
No limits.

Chat, code, generate images, create videos — all running locally. Plug & Play with 12 local backends. 13 built-in tools, coding agent. No cloud, no accounts.

Getting Started Guide

Download for Windows View Source

Locally Uncensored — AI Chat with multi-provider support, coding agent, and 13 built-in tools

Download v2.3.1

Windows (.exe) All Downloads

Plug & Play — choose from 20+ providers. The setup wizard auto-detects 12 local backends (Ollama, LM Studio, vLLM, KoboldCpp, Jan, GPT4All, llama.cpp, and more) or configure cloud APIs in Settings. Other platforms: build from source.

Capabilities

Everything in one app.

Plug & Play with any local backend. Chat, code, generate images, create videos. 20+ providers. 13 tools. No switching between apps.

Agent Mode — 13 MCP Tools

Web search, file I/O, shell commands, code execution, screenshots, system info. Granular permissions per category. Native + Hermes fallback for any model.

Codex Coding Agent

Dedicated coding mode. Reads your codebase, writes code, runs commands. File tree browser, native folder picker, working directory. Up to 20 iterations per task.

Multi-Provider Chat

20+ provider presets. Local: Ollama, LM Studio, vLLM, KoboldCpp, llama.cpp, LocalAI, Jan, and more. Cloud: OpenAI, Anthropic, OpenRouter, Groq, Together, DeepSeek, Mistral. Switch per conversation.

Image Generation

FLUX 2 Klein, FLUX.1, Z-Image Turbo, Juggernaut XL via ComfyUI. Text-to-Image and Image-to-Image. No content filter. One-click setup.

Video Generation

Wan 2.1, HunyuanVideo 1.5, LTX 2.3, AnimateDiff, FramePack F1 (I2V). Text-to-video and image-to-video on your GPU. No watermarks.

File Upload + Vision

Drag & drop images, Ctrl+V paste screenshots, clip button. Vision models describe what they see. Up to 5 images per message.

Granular Permissions

7 tool categories (web, filesystem, terminal, system, desktop, image, workflow). Block, confirm, or auto-approve per category and per conversation.

Model Manager

Browse, install, switch models. Auto-detects text, image, video. Load/unload from VRAM. Hardware-aware recommendations.

Thinking Mode

Toggle thinking for any provider. Native support where available, system prompt fallback for others. See the AI's reasoning in collapsible blocks before the answer.

Locally Uncensored — Local AI image and video generation

Image and video generation — no content filters

Locally Uncensored — AI Model Manager

One-click model installation with hardware recommendations

New in v2.3.1

ComfyUI Plug & Play. 20 Model Bundles. I2I + I2V.

One-click ComfyUI install. 20 image + video bundles. Z-Image uncensored. FramePack I2V on 6 GB VRAM. Image-to-Image. Dynamic workflows. Hardware-aware onboarding.

Plug & Play Setup

First-launch wizard scans 12 local backends automatically. Ollama, LM Studio, vLLM, KoboldCpp, Jan, and more. Nothing installed? One-click install links for every backend. Zero config.

Codex Coding Agent

Dedicated coding mode. Reads your codebase, writes files, runs shell commands. File tree with native folder picker. Up to 20 iterations per task.

13 MCP Tools

Dynamic tool registry: web_search, file_read, file_write, shell_execute, code_execute, system_info, screenshot, and more. Native + Hermes fallback.

Granular Permissions

7 categories: web, filesystem, terminal, system, desktop, image, workflow. Per-conversation overrides. Block, confirm, or auto-approve.

File Upload + Vision

Attach images via clip button, drag & drop, or Ctrl+V paste. Vision models analyze images. Works across all providers.

Thinking Mode

Provider-agnostic. Native support where available, system prompt fallback for others. Collapsible thinking blocks in chat.

Comparison

What others don't do.

Every competitor handles chat. None combine a coding agent, 13 tools, image gen, and video creation in one local app.

Feature	Locally Uncensored	Open WebUI	LM Studio	SillyTavern
AI Chat	Yes	Yes	Yes	Yes
Plug & Play Setup	12 Backends	No	Built-in	No
Multi-Provider	20+ Presets	Yes	Yes	No
Coding Agent	Codex	No	No	No
Agent Tools (MCP)	13 Tools	No	No	No
Image Generation	Yes	No	No	No
Image-to-Image	Yes	No	No	No
Video Generation	Yes	No	No	No
Image-to-Video	Yes	No	No	No
File Upload + Vision	Yes	Yes	Yes	No
Thinking Mode	Yes	No	No	No
Granular Permissions	7 Categories	No	No	No
A/B Model Compare	Yes	No	No	No
Local Benchmark	Yes	No	No	No
Uncensored by Default	Yes	No	No	Partial
Open Source	AGPL-3.0	MIT	No	AGPL
No Docker	Yes	Docker	Yes	Yes

Setup

Plug & Play. Running in under 5 minutes.

1

Download & Install

Download the .exe and install. That's it. No Docker, no terminal, no config files.

2

Setup Wizard Detects Everything

On first launch, the app scans for all 12 local backends automatically — Ollama, LM Studio, vLLM, KoboldCpp, Jan, and more. Found something? Connected. Nothing running? The wizard shows every backend with one-click install links.

3

Chat, Code, Create

Pick a model, start chatting. Switch to Codex for coding. Open Create for images and video. Add more providers anytime in Settings.

Supported Models

Works with the best local AI models.

Auto-detects models from any running backend. Ollama, LM Studio, vLLM, KoboldCpp — just start your backend and models appear.

Chat / Vision

Gemma 4 (E4B / 27B)

Google flagship. Native tools + vision. Apache 2.0. E4B runs on 4 GB, 27B on 16 GB. Recommended in onboarding.

Chat / Reasoning

Qwen 3.5 (9B / 27B / 35B MoE)

Strongest reasoning + coding. 256K context. Abliterated variants available. 8-22 GB VRAM.

Image

FLUX 2 Klein / FLUX.1

Next-gen + classic FLUX. 8-10 GB VRAM. Text-to-Image and Image-to-Image.

Image (Uncensored)

Z-Image Turbo

Explicitly uncensored. 8-15 seconds per image. No safety filters. T2I + I2I. 10-16 GB VRAM.

Video (T2V)

Wan 2.1 / HunyuanVideo

Text-to-video. Wan 1.3B (8 GB) for speed, 14B (12+ GB) for quality. HunyuanVideo for consistency.

Video (I2V)

FramePack F1

Image-to-video on just 6 GB VRAM. Upload an image, get video. Revolutionary next-frame prediction.

FAQ

Common questions.

What is Locally Uncensored?

A free, open-source desktop app for running AI locally. Combines uncensored chat, a coding agent (Codex) with 13 tools, image generation (ComfyUI), and video creation in one interface. 20+ provider presets: Ollama, LM Studio, vLLM, KoboldCpp, llama.cpp, LocalAI, Jan, OpenAI, Anthropic, OpenRouter, Groq, and more. AGPL-3.0 licensed.

Is it really free and offline?

Yes. After setup and model download, no internet needed. No accounts, no telemetry, no usage limits. Cloud providers are optional — the core runs 100% locally.

How is this different from Open WebUI or LM Studio?

Those tools handle text chat. Locally Uncensored adds a coding agent with 13 MCP tools, image generation, video creation, A/B model comparison, local benchmarking, granular permissions, file upload with vision, and thinking mode. All in one app.

What hardware do I need?

Text chat: 8 GB RAM. Images: NVIDIA GPU with 8+ GB VRAM. Video: 10-12 GB VRAM. The app recommends models based on your hardware. Windows 10/11.

What does "uncensored" mean?

Abliterated models with artificial restrictions removed. The AI responds honestly without refusing or adding disclaimers. Combined with local execution, your conversations are completely private.

Blog

Guides and comparisons.

Google Gemma 4 — Run It Locally

All sizes from E4B to 27B. Native tools, vision, uncensored variants.

Image-to-Image with Local AI

Upload a photo, adjust denoise, transform. FLUX, Z-Image, SDXL.

Best Local AI Apps in 2026

Complete comparison of GPT4All, Open WebUI, LM Studio, Jan, and more.

How to Run Uncensored AI Locally

Setup guide. Models, hardware, and why local beats cloud.

LU vs Open WebUI

Both open source. Only one does chat + code + images + video.

LU vs LM Studio

Open source all-in-one vs polished closed-source chat client.

Run your own AI stack. No limits.

Free, open source, and yours to keep.

Download for Windows View on GitHub